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Abstract. Let E be an elliptic curve defined over a number field k. In this 
paper, we define the "global discrepancy" of a finite set Z C E(k) which in a 
precise sense measures how far the set is from being adelically equidistributed. 
We prove an upper bound for the global discrepancy of Z in terms of the 
average canonical height of points in Z. We deduce from this inequality a 
number of consequences. For example, we give a new and simple proof of 
the Szpiro-Ullmo-Zhang equidistribution theorem for elliptic curves. We also 
prove a non-archimedean version of the Szpiro-Ullmo-Zhang theorem which 
takes place on the Berkovich analytic space associated to E. We then prove 
some quantitative 'non-equidistribution' theorems for totally real or totally p- 
adic small points. The results for totally real points imply similar bounds for 
points defined over the maximal cyclotomic extension of a totally real field. 



1. Introduction 

Let k be a number field, and let E/k be an elliptic curve. We denote by h : 
E{k) — > K the Neron-Tate canonical height function on E, which is nonnegative 
and vanishes precisely on the torsion subgroup E(k)t OT . If Z C E{k) is a finite set 
of points, we define the canonical height of Z to be the average of the canonical 
heights of the points in Z, i.e., h(Z) = ^ ^2p e z M-^)- 

In 21 we wm define another quantity T)(Z) called the global discrepancy of Z; 
it is a positive real number which is defined as a sum of local discrepancies. The 
global discrepancy measures, in a certain precise sense, how far the points of Z are 
from being adelically equidistributed. 

We will give an upper bound for the global discrepancy T>(Z) in terms of the 
canonical height h(Z), and we will deduce from this inequality various results. 
For example, we will prove an adelic equidistribution theorem whose statement 
combines ingredients from both [SS] and jSJ. In order to state the result, we recall 
that for each place v S Mj., the theory of Berkovich [3] furnishes an analytic 
space -©Berk,!) which is compact, Hausdorff, and path-connected, and which contains 
E(C V ) as a dense subspace. When v is archimedean, £Bcrk,u = E(C), but in the 
non-archimedean case, i?Bork,-u is much larger than E(C V ). For each place v, there 
is a canonical probability measure fx v on EBerk,v which will be defined in ^5.21 With 
these definitions in mind, we have: 

Theorem 1. Let k be a number field, and let E/k be an elliptic curve. Fix a 
place v € Mfc and an embedding E(k) <^-> E(C V ) C i?Berk,u- Suppose that {P n } 
is a sequence of distinct points in E(k) such that h(P n ) — > 0, and let 5 n be the 
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Borel probability measure on EneA,v supported equally on the set Z n of Gal(k/k)- 
conjugates of P n . Then S n — > fj, v weakly on Eb C i±,v 

The archimedean case of Theorem Q] will be established in H5.ll and the non- 
archimedean case in H5.3I The proof of Theorem ^ in the archimedean case is 
more elementary than the one given in 28, , and is better suited to quantitative 
refinements. In the non-archimedean case, the result generalizes a theorem of [8], 
and again our proof is more elementary. On the other hand, the results of and 
some of the results of [S] apply to abelian varieties in general. 

We will also give a number of quantitative "non-equidistribution" results. Specif- 
ically, we will provide explicit upper bounds for |£'(L) tor | and lower bounds for 
liminfp g £(£) h(P) and inf^pj_^ h(P) when L is one of the following types of alge- 
braic extensions of k: 

(i) L is totally real, i.e., every embedding of L into C has image contained in 
R. 

(ii) L is totally p-adic for some prime number p, i.e., every embedding of L into 
Q p has image contained in Q p . 

(hi) L is the maximal cyclotomic extension of k (when k is totally real). 
The basic idea behind all of these applications of the height-discrepancy inequality 
is that the discrepancy of a finite subset Z C E(L) cannot be too small. It has been 
previously established (see [H3jE21) that \E(L) tm \ < oo and liminfp eB ( i ) h(P) > 
in cases (i) and (iii), so in these cases the novelty in our results is that our method 
of proof leads to completely explicit bounds. 

We now define the global discrepancy T)(Z) of a finite set Z = {Pi, . . . Pjy} C 
E(K), where K/k is a finite extension. The global discrepancy can be thought of 
as a "smoothing out" of the quantity 

A ^ = ^ E kPi-pj)- 

Note that A(Z) > 0, and that the parallelogram law furnishes the inequality 

(1) A(Z) < Ah{Z). 
Furthermore, we can decompose A(Z) as a sum 

v£M K L ^ J 

with 

(2) ^(Z) = ^ £ Xv(Pi-Pj), 

l<i,j<N 

where A„ is an appropriately normalized Neron local height function A„ : E(C v )\{0} 
R. However, the quantity A V (Z) can be negative at archimedean places and at 
non-archimedean places of bad reduction; this is closely related to the fact that the 
singularity of at the origin has forced us to remove the diagonal from (J2J. For 
the applications we have in mind, it is useful to work with a nonnegative variant 
of A V (Z), and thus of A{Z) as well, which can still be bounded explicitly in terms 
of h(Z). At an archimedean place v, Elkies (see ^^1, §VT) accomplishes this via 



SMALL POINTS ON ELLIPTIC CURVES 



3 



convolution with the heat kernel (see also Faltings JOi)- This gives a one-parameter 
family {Xt}t>o of smooth functions Af : E(C) — ► K, defined on all of ^(C), such 
that lim t ^o At = A w and 

l<i,j<N 

Although there is no canonical choice for the parameter t, the simplest choice (which 
gives nearly optimal estimates) is t = l/N. We thus define the archimedcan local 
discrepancy of a subset Z C E(C) to be 

D «( Z ) = jj* E ^( p i- p j)- 

\<i,j<N 

We will see that if {Z n } n >i is a sequence of finite subsets of E(C) such that 
linin^oo D v (Z n ) = 0, then the sequence of discrete probability measures 6 n sup- 
ported equally on the elements of Z n converges weakly to the normalized Haar 
measure /i on E(C). The archimedean local discrepancy can thus be thought of as 
a measure of how far a set Z is from being equidistributed in E(C). 

For non- archimedean places v, there is a simpler way to modify A„ in order to 
obtain a suitable nonnegative quantity. As we explain further in (see also jj]), 
for all P,Q £ E(C V ) with P ^ Q, there is a decomposition 

X v (P-Q)=i v (P,Q)+j v (P,Q), 

where i v (P,Q) is a nonnegative arithmetic intersection term which tends to +00 
as P — * Q and j v (P, Q) is a bounded term which has a natural interpretation in 
terms of the "skeleton" of E(C V ) (see ffH|) . For all P,Qe £(Cu), we set 

= {-- ,p ' <3) 

and 

A;(p-Q) = i;(p,Q)+ Ju (p,g), 

and then define the non- archimedean local discrepancy of a set Z C E(C V ) to be 

^( Z ) = A 7? E A :( p i--P;)- 

l<ij<V 

Note that if v is a place of good reduction, then D V (Z) — A V (Z). 

To help justify the definition of the non-archimedean local discrepancy, we will 
show that for each non-archimedean place v € Mk, there is a natural probability 
measure fi v on the Berkovich analytic space EB C rk,v such that if {Z n } n >i is a 
sequence of finite subsets of E(C V ) with linin^oo D v (Z n ) — 0, then the sequence of 
discrete probability measures 6„ supported equally on the elements of Z n converges 
weakly to fi v on i?Bcrk,D- Thus the non-archimedean local discrepancy can also be 
thought of as a quantitative measure of equidistribution (or non-equidistribution) 

in EBcrk,v- 

Finally, we state the main inequality linking the global discrepancy 

^) = E ^rf^) 

veu K L J 
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of a subset Z C E(k) with the canonical height of Z. (Here K is any finite extension 
of k such that Z C E(K) and E/K is semistable; one can show that T>(Z) does 
not depend on the choice of K .) Recall that 

Height-Discrepancy Inequality. Let Z = {Pi, . . . ,Pn} Q E(k) be a set of 
N distinct algebraic points. Then 

(3) T>(Z) < 4h(Z) + I (i log AT + ±h(j E ) + f) , 

where /i(j,e) is the logarithmic absolute Weil height of the j-invariant je of E/k. 

Note that as N — > oo, © gives the same asymptotic estimate as {Q. The proof 
of © will be given in Theorem 0D of AO 

The techniques used in this paper combine ideas from various sources. The use of 
the local sums A V (Z) in the context of the equidistribution of small points originates 
in Baker- Rumely pQ. A related approach for the multiplicative group G m occurs 
in Bombieri's paper which inspired the Fourier-theoretic discrepancy methods 
employed here. Bombieri's approach is a simplification of Bilu's in gj. We also note 
that a series of papers by Hindry-Silverman |17| . [T%| makes use of explicit lower 
bounds for the sums A V (Z) in order to obtain quantitative results concerning the 
heights of points in E(k). 

2. The Archimedean Local Discrepancy 

Throughout this section E/C is an elliptic curve with j-invariant je, and fi 
denotes the unit Haar measure on the compact group E(C). 

2.1. The Fourier Transform and Laplacian. Let F^ denote the dual group 
of E(C), that is the continuous homomorphisms of E(C) into the circle group 
T = {z e C | \z\ = 1}. Given / G L l (E(C), //), the Fourier transform / : T E -> C 
is defined by 

Hi) = [ f(P)W)d^P); 
Je(c) 

and similarly, given a signed Borel measure m on E(C), we define its Fourier- 
Stieltjes transform rh : Te — > C by 

m(i) = / y(P)dm(P). 

JE(C) 

As E(C) is compact, its dual Te is discrete, and thus each / e i 1 (_E(C),^) has a 
Fourier series 

(4) /(P) ~ E /(7)7(^). 

7er B 

We have equality in |0J provided the right-hand-side is absolutely convergent. 

Denote by S(E(C)) the space of smooth functions on the curve, and by S'(E(C)) 
its dual, the space of distributions. We define the Laplacian on E(C) as an operator 
A : S(E(C)) — > S(E(C)) as follows. Fix a complex uniformization 



(5) 



E(C) ~ C/L 
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for a normalized lattice L = Z + tZ (t = a + bi £ H) , and let z = x + yi € C be a 
complex variable. Now, given g 6 S(E(C)), we define Ag € S(E(C)) by 



(6) A5 = ^te + ^ 



% 2 



.9- 



It is straightforward to show that this definition does not depend on the choice of 
L in its homothety class, and thus A is well defined on the curve E(C). 

Given a uniformization (J5J, one can write down explicitly the characters 7 £ Te- 
In this case the dual group is parametrized by the lattice L itself; that is, for 
each lattice point u> = n\ + £ L, we have a character 7^ : C/L — > T given by 

(7) 7w (z) = e 2Ti(nin+n 2 r 2 ) j 

where r±, r-i are real variables, and z = x + yi = r\ + r 2 r £ C. In order to describe 
the effect of the Laplacian on characters, we first define a permutation u> 1— ► u>' of 
the lattice L via the formula 

(8) (rti + ji 2 t)' = ri2 - niT. 

Since r 2 = y/b and ri = 2 — ay/b, applying the Laplacian to the function 

7u) (s) = e 2^(«ir 1+n2 r 2 ) 

_ 27ri(nix+& _ (712 — nia)y) 

we find that 

(9) A 7 .(z) = -yl^M. 



2.2. The Neron Function. Let A : E(C) \ {O} -> R denote the Neron function, 
as defined and normalized in [27J, §VI.l and ^H], §11.5. The most concrete way to 
define A is in terms of a complex uniformization E(C) ~ C/L, where L = Z + tZ is 
a lattice and T = a|iiel. Let z = x + = r\ + r 2 r be a complex variable, with 
x,y,ri,r2 £ K, and let it = e 27 ™ 2 and q = e 27TlT . In this local coordinate system, 
the Neron function A : (C/L) \ {0} — > K is given by the formula 



(10) A(z) = - 




where B 2 (T) = T 2 - T + 1/6 is the second Bernoulli polynomial (cf. [27], §VI.3). 

A more conceptual definition of A can be given in terms of Green's functions. 
First, note that since A is continuous except for a logarithmic singularity at O, it 
is /z-integrable. Define distributions 5o, D^, Da\ £ S'(E(C)) by So(g) = g(0), 

DAx(g) = f \(P)Ag(P)d»(P), 
Je(c) 

and 

dm = I g(PW(P), 

JE(C) 

where g £ S{E{C)). 



6 



MATTHEW BAKER AND CLAYTON PETSCHE 



The following proposition, which is proved in §H, Theorem 5.1, says that A 
is the Green's function on E(C) with respect to the divisor (O). 1 

Proposition 1. We have 

(11) f \{PW{P) = o, 

Je(C) 

and 

(12) D AX = Dp- 5 . 

More generally, setting g(P, Q) = g Q (P) = X(P-Q), we have / g(P, Q) dfi(P)dfi(Q) 
and D&g Q = — 8q, which means that g(P,Q) is the unique normalized 
Arakelov-Green's function on E(C) with respect to n (see §11, 0, and £Q). 

It is evident from © that the characters of E(C) are eigenfunctions of the 
Laplacian. The following proposition shows that the eigenvalues are closely related 
to the Fourier coefficients of the Neron function. Wc let E Te denote the trivial 
character of E(C). 

Proposition 2. Let 7 6 Te be a character of E(C). Then 

A7 = -c 7 7, 

where 

_ |0 ifl = 70, 

7 \ l /Kl) ifl^lo- 
Moreover, we have A (7) > 0, with equality if and only i/7 = 7o- 

Proof. The fact that the characters are eigenfunctions for the Laplacian follows 
from (J5J, from which it also follows that c 7 > 0, with equality if and only if 
7 = 7o- Applying the distributional identity \12\ to a nontrivial character 7, we 
find J E{C) X(P)Aj{P)dfi{P) = -1. On the other hand, 



X(P)A 7 {P)dfx(P) = -c 7 / X(P) 1 (P)d t ,(P) 

E(C) JE(C) 

= -c 7 A(t"), 



where 7(F) = j(P) is the complex conjugate of 7 e Te- Therefore c 7 = 1/A(7). 
The proof of the proposition is now complete upon noting that A(7) = A (7), since 
A is real- valued. □ 

Given a complex uniformization (JSJ , we can now give an explicit formula for the 
Fourier coefficients of the Neron function. If w e L is a lattice point, and 7^ is the 
character given in (JJJ), then it follows from Proposition [3 and that 



(13) A( 7w ) 



if w = 

^2tt\u>'\ 2 ~ 



There are different conventions in the literature for what one means by a Green's function; in 
Lang's book 1191 . for example, the Green's function is twice the Neron function A, whereas others 
take the Green's function to be —A. See the remark on page 22 of 1191 . 
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2.3. The Heat Kernel. The Fourier series 

a(p) ~ J2 k^HP) 

7Gr E 

for the Neron function does not converge absolutely, due to the fact that it has 
a singularity at O. Following Elkies (cf. 19 , §VI), we sidestep this difficulty by 
convolving with the heat kernel 

g t (P) = ]T e- t AW 7 (P), 
7er E 



where t > is a parameter, and put 

X t (P)=X*g t (P) 



(14) Je 



KQ)m{P - QW(Q) 



= J2 KiY- t/Ki) i{py 

l£-T E \{lo} 

2.4. Discrepancy. Let Z = {Pi, . . . , P/v} C E(C) be a set of N points on the 
elliptic curve E(C). We define the discrepancy of this set to be 

l<i,j<N 

The following proposition shows that D(Z) is essentially a classical L 2 -type dis- 
crepancy, measuring the /i-uniform distribution of the set Z. 

Proposition 3. For each n > 1, let Z n C E(C) be a set of N n distinct points, and 
let 6 n denote the probability measure on E(C) that assigns a mass ofl/N n at each 
point of Z n . If D(Z n ) —>■ 0, then the sequence of measures S n converges weakly to 
Haar measure fi. 

Proof. In order to show that 5 n — > /i weakly, it suffices by Fourier inversion to show 
that o~ n {l) A*(7) — for all nontrivial characters 76^. By Parseval's formula 
and the fact that At( 7 o) = 0, we have 

D{Z n )= AT(7)I^(7)| 2 

(16) " — . 

7er E \{ 70 } 

Now, given a nontrivial character 7 £ Te we have At (7) / A( 7 ) > as t — > + , by 
(THJl . It follows that 

limsup{|(57 l (7)| 2 /^(^ l )} < limsup{l/AT( 7 )} 

= 1/A( 7 ) < +00. 

The desired limit <5«(7) — > now follows from the assumption that D(Z n ) — > 0. □ 

The following result is a quantitative refinement, for the special case of elliptic 
curves, of a general estimate due to N. Elkies (see P3, §VT, Theorem 5.1). Although 
similar quantitative estimates have been used frequently in the literature (see for 
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example ^2] and to our knowledge, the details underlying these estimates 

have never been published. For this reason, we include a proof of Proposition 0] as 
an appendix to this paper. 

Proposition 4. Let E/<C be an elliptic curve with j-invariant j e , and let 

Z = {P 1 ,...,P N }cE(C) 

be a set of N distinct points. Then D(Z) > 0, and 

(18) £ A(P,-P i )>^(Z)-^-glo 6+fc |-l^. 

l<i,j<N 

3. The Non-archimedean Local Discrepancy 

In this section, v denotes a non-archimedean place of a number field fc, k v denotes 
the completion of k with respect to v, and C v denotes the completion of the algebraic 
closure k v . We also let O v denote the ring of integers of C v and F„ its residue field. 
Finally, we let E/C v be an elliptic curve with j-invariant je- 

If \je\v < 1, then E extends to an abelian scheme over O v , i.e., there exists a 
smooth proper model £ over O v whose special fiber £ is an elliptic curve over F„. 
In this case we say that E has good reduction. 

On the other hand, if \je\v > L then by Tate's non-archimedean uniformization 
theory (see |23 §V, EH, ESI): there is an analytic isomorphism E(C V ) ~ C* v /q z , 
where q <E C* and \q\ v < 1. In this case, we say that E has (split) multiplicative 
reduction. 

3.1. The retraction homomorphism. Suppose first that E has good reduction, 
and let £ be a model for E over <D V . The special fiber £ of £ is an elliptic curve over 
Ft,, and there is a canonical surjective reduction map it : E(C V ) — > £(¥ v ) which is 
a group homomorphism. Abusing notation somewhat, we define 

(19) E = £(¥ v ) 

to be the image of the reduction homomorphism. 

Now suppose that E has multiplicative reduction. If P G E(C V ), we write u(P) 
for the image of P in C*Jq z under the Tate map E(C V ) ~ C*/(j z . We define the 
retraction homomorphism r : E(<C V ) — + K./Z by 

(20) r(P) = ^M^. 

We call the circle group R/Z the skeleton of £7; this terminology comes from 
Berkovich's theory of analytic spaces and will be discussed further in M5.2I 

Note that since \C*\ V = p®, the image of the retraction homomorphism is actually 
contained in the subgroup Q/Z of M/Z. 

3.2. The Neron Function. Let A„ : E{C V ) \ {O} -> R denote the Ncron local 
height function, as defined and normalized in |27], §VT.l. In order to describe this 
function explicitly, we let Eo(C v ) denote the subgroup of E(C V ) consisting of those 
points with "non-singular reduction" . More concretely, if E has good reduction 
then E (C V ) = E(C V ), and if E has multiplicative reduction, then E (C V ) = ker(r) 
is the kernel of the retraction homomorphism. 
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Definition. The Neron function is defined as follows: 

(i) If Pe E Q (C v )\{0}, then 

x v (p) = io g + \zipyx + i og + \ JE \ V 

= ilog + | 2 ;(P)|. i , + ^log + \j E \ v , 

where x and y are the usual coordinate functions associated to a Weierstrass 
equation for £ over O v , and z — —x/y is the standard local parameter at 
the origin. 

(ii) If P £ E(C V ) \ E (C V ) is a point with singular reduction, then 
A„(P) = ±B 2 (r(P))log+ \j E \ v , 



where 



2 



l -V 2 (t)= l -(t-[t\Y- l -{t-[t]) + ^ 



- £ 1 



(2nk) 2 

fcez\{o> 

is one-half the periodic second Bernoulli polynomial. 

Note that log + \Je\v — when E has good reduction, and log + \je\v = — log \q\ > 
when E has multiplicative reduction. 

Since this definition of A„ may appear rather ad-hoc, we offer the following 
alternative description. Following |7|, there is a decomposition 

X v (P-Q) = i v (P,Q)+j v (P,Q), 

valid for all pairs {P,Q) S -E(Cu) x E^C,,) with P ^ Q, where i v is a local in- 
tersection term, and j v is a function which factors through the retraction map 
r : E(C V ) -> R/Z. 

When E has good reduction, i v (P, Q) is (up to a constant multiple) the scheme- 
theoretic intersection multiplicity of P and Q on £, which in concrete terms means 
that 

i v (P,Q) =log + \z(P-Q)- 1 \ v 

When E has multiplicative reduction, one can define i v (P, Q) using a Tate uni- 
formization via the formula 

. mn ,_S r(P)^r(Q) 
V[ F ' Q >-\ -logll-SjgU r(P) = r(Q), 

where u(P),u(Q) S C* are representatives of the classes of P and Q, respectively, 
in C*/q z , chosen in such a way that |w(P)|„ = (which is possible precisely 

when r(P) — r(Q)). 

It is easy to see that in both cases, i v (P,Q) is symmetric in P and Q, and 
provides a natural measure of how u-adically close P and Q are. 

When E has good reduction, j v (P,Q) is identically zero. When E has multi- 
plicative reduction, we define 

j„(P,Q) = iB 2 (r(P-Q))log+ \j E \ v . 
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Note that j v (P, Q) is also symmetric in P and Q, and that j v {P, Q) = \3e\i 
if r(P) — r(Q). For those familiar with the language of metrized graphs (see [7], 
[2], and PP), we point out that g(x,y) — ^B2(r(P — Q)) is the unique normalized 
Arakelov-Green's function for the Haar measure \i on the circle R/Z (and so the 
function j v (P, Q) is not as arbitrary as it might appear) . 

3.3. The reduction homomorphism. We can use the w-adic intersection func- 
tion i v (P,Q) to define the reduction E of E and a reduction homomorphism 
7r : E — > E without explicit reference to models in the multiplicative reduction 
case. 

Recall that i v (P, Q) is a symmetric, nonnegative, real- valued function of P and 
Q, defined for all P,Q € E(C V ) with P ^ Q, which measures the w-adic proximity 
of P and Q. We extend i v {P,Q) to all P, Q G E(C V ) by defining i v (P,P) to be 
+00. 

Now define a relation ~ on E(C V ) by declaring that P ~ Q if and only if 
i v (P,Q) > 0. It is easy to verify that ~ is an equivalence relation. Intuitively, 
P ~ Q iff P and Q are congruent modulo the maximal ideal of C v . We therefore 
define the reduction 23 of P to be the set of equivalence classes under ~. If we denote 
the equivalence class of P € E(C V ) by [P], then there is an obvious reduction map 
7T : P -> P given by tt(P) = [P]. The binary operation given by [P] + [Q] = [P + Q] 
is easily verified to be well-defined, and furnishes P with a natural abelian group 
structure. The reduction map from P to E is then a group homomorphism. 

When P has good reduction, and £ is a model over the ring of integers in C„ 
for P whose special fiber £ is smooth, it is easy to see that P = £(¥ v ) as abelian 
groups. 

The more interesting case is when P has multiplicative reduction. In this case, 
one has the following alternate description of the equivalence relation ~. By (|20|l . 
the kernel of the retraction homomorphism r : P(C„) — > Q/Z is isomorphic to 

{«£C„ : |u| = l} = 6;. 

If p : ker(r) — > <D* V — > F„ denotes the isomorphism of ker(r) with O* followed by 
the natural map from O v to its residue field, one easily verifies that P ~ Q if and 
only if r(P) = r(Q) and p(P — Q) = 1. From this description, it follows easily that 
there is a commutative diagram of abelian groups 

1 ► 6* > E(C V ) — — > Q/Z ► 

I 1' II 

1 > F* ► P ► Q/Z > 

r 

(Compare with ^H], §11, Expose IX when C„ is replaced by a discretely valued 
field.) 

As sets, there is a non-canonical isomorphism 
(22) P~Q/ZxF*. 

Indeed, E is the disjoint union over all t e Q/Z of E t — {P e E : r(P) = t}, 
and each set E t is non-canonically isomorphic to ker(f) — F^ by translation. More 
precisely, each Et is a principal homogeneous space for F„. 
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3.4. Discrepancy. We can extend the Neron function to a function A* : E(C V ) 
R denned on the entire curve by setting 



(23) \* V (P) = 



X V (P) xiP + O, 

^\og + \j E \ v if P = 0. 

Replacing A„ by A* is in some sense analogous to the convolution (|14|l with the 
heat kernel in the archimedean case. One evident difference, however, is that A* 
is discontinuous, whereas the convolution (jl4|l smooths out the singularity at the 
origin. If we define i* v : E(C V ) x E(C V ) — > R by the formula 

i* v (P,Q) 

then 

(24) X* v (P-Q)=i* v (P,Q)+j v (P,Q), 
valid for all pairs (P, Q) S E(C V ) x 




Now, given a set Z = {Pi, . . . , Pn} C E(C v ) of N points, we define the local 
discrepancy D(Z) of this set by 

(25) D{Z) = h £ K(P*-P 3 )- 

l<i,j<N 

Using H24|) . we have a decomposition 

(26) D(Z) = D i (Z)+D j (Z), 
where 

(27) D l {Z) = ^ ]T it(Pi,Pj) 

l<i,j<N 

is the congruence discrepancy, and 

l<iJ<N 

is the retraction discrepancy. 

Clearly Di(Z) > 0, and we will see in a moment that Dj(Z) > as well. Note 
that if E has good reduction, then Dj(Z) = by definition. So the retraction 
discrepancy is only relevant in the bad reduction case. 

The next two results illustrate the fact that when E has multiplicative reduction, 
the retraction discrepancy Dj(Z) is a measure of the /x-uniform distribution of the 
retraction r(Z) of the set Z, where fi is Haar measure on the circle group R/Z. 

Proposition 5. Let Z = {Pi, . . . , P/y} C E(C V ) be a set of N distinct points. If 
E has multiplicative reduction, we have 



(28) D j (Z)=log+\j E \ v x ' 



^ (27rfc) 2 



^ N 



2 



> 0. 



Proof. The formula for Dj(Z) follows immediately from Parseval's formula and the 
Fourier expansion H2I(I of the second Bernoulli polynomial. The fact that Dj(Z) is 
positive is then an easy consequence of the explicit formula. □ 
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In addition, one has the following analogue for the circle group R/Z of Proposi- 
tionlHl For the statement, we define the discrepancy of a Borel probability measure 
v on R/Z to be 



D{v) = J J f{x-y)dv{x)dv{y), 

where f(t) = log+ 2 Us| B 2 (t). Letting v{k) = f R/z e- 2ntkt dis(t) and 

f o -w fO if k = 0, 

f(k) = / f(t)e- 2nlkt dt = },+,., 

we then have 

(29) i^H£/»|P(fc)| 2 

fcez 

by Parseval's formula. 

If E has multiplicative reduction and Z = {P\, . . . , P^} C E(C V ) is a set of iV 
distinct points, and if we define 



1 N 



i=l 

to be the natural discrete measure supported on the multiset r(Z), then it follows 
from the definitions that Dj(Z) — D{8z)- 

Proposition 6. If v n is a sequence of probability measures converging weakly on 
R/Z to v, then D(y n ) — * D(v). Furthermore, D(i>) = if and only if v is the 
normalized Haar measure \x on R/Z. 

Proof. The convergence of discrepancies follows from the continuity of f(x — y) 
and the fact that v n x v n converges weakly to v X v on R/Z x R/Z. The last 
statement follows from (|29|) and the fact that f(k) > for all k € Z \ {0}, since [/, 
is characterized by the fact that ju(fc) = for all k € Z \ {0}. □ 

Finally, we have the following result, which is a non-archimedean analogue of 
Proposition 

Proposition 7. Let Z = {Pi, . . . , Pjy} C E(C V ) be a set of N distinct points. 
Then D(Z) > and 

(30) U^~P 1 )^N 2 D(Z)-^\og+\j E \ v . 

l<i,j<N 

Proof. The non-negativity of the discrepancy follows from 126(1 and the fact that 
Di(Z) and Dj(Z) are non- negative. The identity (|30|l follows at once from the 
definition of A*. □ 



4. Global Discrepancy 
Let k be a number field, and let Ejk be an elliptic curve with j-invariant je- 
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4.1. Definition of the global discrepancy. In this section, we define the global 
discrepancy D(Z) of a set Z = {Pi, . . . , Pn} C E(k) of algebraic points. 

Let K be any number field such that Z C E(K) and E/K is semistable. If v is a 
place of K, then E is defined over K v , and we can view Z as a subset of E(C V ) via 
the embedding If <^-> C„. We let D V (Z) denote either the complex local discrepancy 
defined in l(15j) . or the non-archimedean local discrepancy defined in 125(1 . We now 
define the global discrepancy 



v£Mk 

It is straightforward to show that this quantity is an absolute diophantine invariant 
of the data k, E, and Z, and doesn't depend on the choice of the number field K. 

Finally, we recall the definition of the logarithmic absolute Weil height h : Q — > 
[0, +oo): given a £ Q, select a number field K containing a, and define 

Again, as is well known, the value of h(a) does not depend on the choice of K. 



4.2. An upper bound for the global discrepancy. Given a set of algebraic 
points Z = {Pi, . . . , Pn} C E(k), define its height 



1 N 



3 = 1 

to be the average of the height of its points. 

Theorem 8. Let Z = {Pi, . . . , Pn} f= E(k) be a set of N distinct algebraic points. 
Then 

(32) V(Z) < 4ft(Z) + l(i logAT+i^ + H 

Proof. Let K be any number field such that Z C E(K) and E/K is semistable. 
Let d = [K : Q], and for each place v of K, let <i„ = [K v : Q„] and 

(33) A*(Z) = -L ^ A^Pi-Pi). 



A 2 

l<i,j<JV 

If u is an archimedean place, then 

A„(Z) > DJZ) - - — log+ \j E L - — 

by Proposition 01 If v is a non-archimedean place, then 
(34) A V (Z) = D V (Z) - log+ \ JE \ V 



14 



MATTHEW BAKER AND CLAYTON PETSCHE 



by Proposition |3 Summing over all places of K, we have 
A(Z)= £ ^K(Z) 

v£M K 

- V ' 2N 12N ^ d 6 u 1 5N 

v&Mk 

= V{Z) -^F-T2N h{jE) -5N> 

where we have used the fact that J2 v \oo ^ = d- 

On the other hand, by the parallelogram law, and the fact that the Neron-Tate 
height is nonnegative, we have h(Pi — Pj) < 2h(Pi) + 2h(Pj) for all 1 < i,j < N. 
In view of this and the decomposition 

(35) h(P) = £ ^K{P), 

v£Mk 

valid for all P E E{K) \ {0} (cf. |23, §VI.2), we have the upper bound 



v£M K 

= ^2 E to-i*) 

l<i,j<N 

(36) <^f E (2W + 2^)) 

l<i,i<iV 

* £ E to) 

l<i<JV 

= Ah(Z). 

Combining the upper and lower bounds for the sum A(Z), we deduce (|32l) . □ 



In g5J we will use Theorem[S]in conjunction with the following inequality relating 
local and global discrepancies: 

Proposition 9. Let Vq € Mk be a place of k, and let Z be a finite subset of E(k). 
Then D Va (Z) < [k : Q]V(Z). 

Proof. Select a Galois extension K/k containing k(Z), and over which E has every- 
where semistable reduction. Since Z is Gal(if/fc)-stable, we have D Vo (Z) = D W (Z) 
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for all places w € Mk that are Gal(i4T/fc)-conjugates of vo- Thus 

»w - E w ^' 

" ^ [K:Q] Davo{Z) 

(37) o-eGal(Jf/fc) L ^ J 

a£Gal(K/k) L J 

= [k : Q] _1 -D(2"). 



□ 



5. The Local Equidistribution of Small Points 

5.1. The Szpiro-Ullmo-Zhang equidistribution theorem for elliptic curves. 

In this section, we show how one can obtain from Theorem |S| an "elementary" (in 
the sense that it does not use Arakelov intersection theory) proof of the Szpiro- 
Ullmo-Zhang equidistribution theorem [2Hj for elliptic curves. 

Theorem 10 (Szpiro-Ullmo-Zhang). Let k be a number field, and let E/k be an 
elliptic curve. Fix an embedding k <— > C, so that E(k) C E(C). Suppose that {P n } 
is a sequence of distinct points in E(k) such that h{P n ) — > 0. Let S n be the Borel 
probability measure on E(C) supported equally on the set Z n of Gal(k / k)- conjugates 
of P n . Then 5 n — > \x weakly on E(<C), where fi denotes the normalized Haar measure 
on E(C). 

Proof. By Proposition|31 it suffices to show that D(Z n ) — > 0, where D(Z n ) denotes 
the archimedean local discrepancy of Z n c E(C). Let T>(Z n ) denote the global 
discrepancy of the set Z n C E(k). Then D(Z n ) < [k : Q]V(Z n ) by Proposition 
Since the points P n are distinct and h(P n ) — *■ 0, it follows from the Northcott 
finiteness principle that \Z n \ — > +oo. (The Northcott fmitcncss principle is the 
statement that for any M > 0, there are only finitely many points P with given 
degree and h{P) < M .) Also, h(Z n ) = h(P n ) — > by the Galois-invariance of 
the canonical height. Therefore, from Theorem [S] we deduce that D(Z n ) < [k : 
Q]V(Z n ) — * 0, which completes the proof. □ 

5.2. The Berkovich analytic space associated to an elliptic curve. In the 

next section, we will prove a non-archimedean version of the Szpiro-Ullmo-Zhang 
theorem for elliptic curves. The natural context for the result is the Berkovich 
analytic space attached to an elliptic curve, so we first recall some basic facts from 
§IV (see also §7.2] and [3D])- To make our results more accessible, we 
summarize all of the properties of Berkovich analytic spaces needed to understand 
our proof. 

Let E be an elliptic curve defined over C v . The Berkovich analytic space 
-Epierk = ^Berk.u over C v associated to E is a compact, Hausdorff, and path- 
connected topological space which contains E(<C V ) (endowed with its usual ultra- 
metric topology) as a dense subspace. We define the skeleton S of -Eeerk (c.f. © 
to be a single point when E has good reduction, and to be the circle R/Z when E 
has multiplicative reduction. The skeleton S can naturally be viewed as a subspace 
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of i?Bork- Moreover, there is a strong deformation retraction r : EBcrk — * E which 
extends the "retraction homomorphism" discussed in In particular, when E 
has good reduction, the space -Ebci-Ic is contractible, and when E has multiplicative 
reduction, its fundamental group is isomorphic to Z. 

Each connected component of _E(C„)\E is homeomorphic to an open Berkovich 
disk, which can be given the structure of an infinite real tree (or, in Berkovich's 
terminology, a simply-connected one-dimensional quasipolyhedron) , see |24| . 0, 
|3U| . and We may thus think of i?Bork as a family of infinite trees glued 

together along the skeleton E. 




Figure 1. The Berkovich space associated to an elliptic curve 
with multiplicative reduction. 

More precisely, the complement of E in -Eeerk is a disjoint union of connected 
open sets 

-EBerk\£ = H Q ei?B Q , 

with each B Q homeomorphic to an open Berkovich disk. We have B Q ni?(C 1 ,) = B a , 
where B a = {P G E(C V ) : n(P) = a}. When E has multiplicative reduction, the 
closure of B Q in i?Bcrk is B Q U {f (a)}, where we identify f (a) e Q/Z with its image 
in E = R/Z C iiBcrk- The set of balls B a whose closure meets a given point t € E 
is either empty (if t $ Q/Z) or else forms a principal homogeneous space for F„ (if 

teQ/Z). 

We now define a canonical probability measure /i = ji v (analogous to normalized 
Haar measure in the archimedean case) which is supported on S C -©Berk- 

Definition. The measure fi is defined as follows: 

(i) If E has good reduction, \i is the Dirac measure concentrated on the skeleton 
E, which consists of a single point. 

(ii) If E has multiplicative reduction, fj, is the uniform probability measure (i.e., 
Haar measure) supported on the circle E = K/Z. 

Remark 38. There is a natural way to define a Laplacian operator on -Eecrk (see 
|3l)j ) . and also to extend the function g(P, Q) = X V (P — Q) from E(JC V ) to £?Berk- 
Having done so, one can show that g(P, Q) is the unique normalized Arakelov- 
Green's function on i?Berk with respect to the measure [i (compare with H2.2fl . 

For each a € E, fix a point Q a € E(C V ) with Tr(Q a ) = a. For < r < 1, let 
B + (a, r) C B Q be the closure in i/Berk of the set 

B+(a,r) = {PeE(C v ) : i v (P,Q a ) > log(-)}. 
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Then 

B a = U r<1 B+(a,r), 

and we have 

(39) P,Q£ B+(a,r) ^ i v {P,Q) > log(-). 

r 

5.3. A Berkovich space analogue of the Szpiro-Ullmo-Zhang theorem for 
elliptic curves. In this section, we prove our non-archimedean version of the 
Szpiro-Ullmo-Zhang theorem for elliptic curves. 

Let Z C E(C V ) be a subset of cardinality N — N(Z). For each a £ E and each 
< r < 1, define 

N atr (Z) = \ZnB+(a,r)\. 

Also, define 



D ilT (Z) = £ 



N a , r (zy 2 

N(Z) 



The following result, which relates Di^ r (Z) to the congruence discrepancy Di(r) 
defined in 1)2 7[). is in some ways analogous to Proposition 0] 

Proposition 11. Let Z = {Pi,...,Pjv} C E(C V ) be a set of N distinct points, 
and let < r < 1 . TTien 

(40) A,^)<^ + i 

Proof. Recall from (|HJ) that if P,Q £ B+{a,r), then i v (P,Q) > log(i). By 
definition, there are N a ^ r {Z) points of Z in P + (a,r), and therefore there are 
N a , r (Z)(N a . r (Z) - 1) pairs (P, Q) with P, Q e Zfl P+(a,r) and P ^ Q. It 
follows that 



(41) 



and thus 



N 2 D t {Z) = *v{P,Q) 

P,Qt=z 

> log(l/r) ^ N a , r (Z)(N a<r (Z) - 1) 



^ffi >iV 2 A,r(^)-ViV a , r (g) 

(42) l0 S(") a ^ 

> N 2 D hr (Z) - N, 

which gives the desired result. □ 

For each n > 1, let Z n C E(C V ) be a set consisting of iV n distinct points, and 
let S n denote the probability measure on Peerk supported equally at the elements 
of Z n . Note that by definition, we have 

r i-o + f \\ ^a,r\Z n ) 



18 



MATTHEW BAKER AND CLAYTON PETSCHE 



Proposition 12. (a) Suppose < r < 1. If Hindoo D^ r {Z n ) = 0, then 

V (Z ' 
AW 



lim jVa 'I ,(Z " ) = 



for all a G E 

(b) Suppose that for all a £ E and all < r < 1, we /iave 

N ar (Z n ) 
lim !-; — = 0. 

n— «x> N[Z n ) 

Then every subsequential limit v of {5 n } is supported on the skeleton £ C 

-Epjcrk ■ 



Proof. Part (a) follows from the fact that 

< liminf 



JVa,r(Z„) 



< lim sup 



N(Z n ) 
N a , r (Z n ) 



N{Z n ) 
< limsupDi jr (Z„) = 0. 

n — >oo 

To prove (b), fix a € E and < r < 1, and let ^4 = B + (a, r). Let ^ be any weak 
subsequential limit of the sequence {S n }. 

We claim that v(A) — 0. To see this, fix r' with r < r' < 1 and let B = 
B + (a, r'). Since the closure of the complement of B is disjoint from A, it follows 
from Urysohn's Lemma (which is valid for every compact Hausdorff topological 
space) that we can choose a continuous function / on £Bcrk such that / = 1 on A, 
f = outside B, and < / < 1 on B. Then 



(43) S n (A) < / fdS n <5 n (B) 
and 

(44) v(A) < I f dv< v{B). 

J E Berk 

By hypothesis, lim n ^oo $n(A) = lim n ^oo S n (B) — 0. Therefore 

lim / fdS n = Q 



n — >oo 



by (|43|l . Since linXn^oo J E f dS n = J E ^ f dv by weak convergence, we have 
J E f dv = and therefore v(A) — by 144|l . This proves the claim. 
Thus i/(B a ) = for all a S E, since B a = U r <iB + (a, r). 

Since E is countable by |(2!2J|, we have f(£7) = as well, where U = U agJ jB a , so 
that U is contained in the complement of the support of v. But U ag £;B Q = i?Berk\S, 
so f is supported on £ as claimed. □ 

From these results, we deduce the following non-archimedcan analogue of Propo- 
sition |3 for the local discrepancy defined by H25|) : 

Corollary 13. //linin^oo D(Z n ) = 0, then S n converges weakly to fi on -Eeerk- 
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Proof. Since i?Berk is compact, after passing to a subsequence if necessary, it suffices 
to prove that if S n — ► v then v = fi. By l|26|) . there is a decomposition D(Z n ) = 
Di(Z n ) + Dj(Z n ) into nonnegative terms, and thus liron^oo Di(Z n ) = Dj(Z n ) = 0. 
Since Di(Z n ) > for all n, and since linin^oo Di(Z n ) = 0, Proposition II II implies 
that linin^oo Di jr (Z n ) = for every < r < 1 and every a £ E. It follows from 
assertions (a) and (b) of Propositionll2lthat v is supported on S. If £ consists of a 
point, then we're done. In the case where £ is a circle, Proposition Eland the fact 
that linin^oo Dj(Z n ) = show that D(v) = and hence v = fi. □ 

We apply these observations to obtain the following non-archimedcan version of 
the Szpiro-Ullmo-Zhang equidistribution theorem for elliptic curves, which gener- 
alizes a result of Chambert-Loir [H]) (see also pQ). 

Theorem 14. Let k be a number field, and let E/k be an elliptic curve. Fix a non- 
archimedean place v of k, and an embedding of k into C v (this allows us to consider 
E(k) as a subset of E-q c ^ v ). Suppose that {P n } is a sequence of distinct points 
in E(k) such that h(P n ) — > 0. Let S n be the Borel probability measure on -E/Berk.u 
supported equally on the set Z n of Gal(k / 'k)~ conjugates of P n . Then S n — * fJ-v weakly 
on EBcrk,v ■ 

Proof. By Theorem |H1 and Proposition El our hypotheses imply that the local dis- 
crepancy D(Z n ) tends to 0. Thus 6 n -> fi v by Corollary IT31 □ 

Remark 45. For elliptic curves with multiplicative reduction, Chambert-Loir |S] 
does not formulate his results in terms of the Berkovich analytic space -EBcrk; rather, 
he proves (in the terminology of H3.4|) that <5 r (z„) converges weakly to /i on the circle 
group S. In this sense, our result is more general than his. On the other hand, in 
the good reduction case, Chambert-Loir proves a generalization of Theorem 1141 to 
the Berkovich space attached to an arbitrary abelian variety. 

6. Quantitative Applications 

The general idea behind the results in this section is that, under certain special 
conditions, a set of global points cannot be too uniformly distributed at all places. 
Specifically, we will use Theorem |51 to deduce quantitative upper bounds for the 
number, and lower bounds for the height, of such points. We begin by establishing 
two useful lemmas. 

Lemma 15. Let N > 1 satisfy the bound N < AlogN + B for constants A > 
0,B>0. Then N < (e/(e - l))(Alog A + B) < f (A log A + B). 

Proof. Let x > 1 and y > e be real numbers, related by the equation y = x/\ogx. 
Note that 

X 

y log y = i — (log x - log log x) 

log a; 
= A _ log log x \ 
V log X I 

^--^ 

and therefore x < (jzr)2/logy. We will apply this inequality with x = Ne B ' A > 1. 
By our assumption that N < AlogN + B, we have Ne B/A < Ae B/A \og(Ne B/A ), 
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which implies that y = x/\ogx < Ae B l A . We then have 

Ne B ' A = x 



< (-^ Ae B/A Xog{Ae B/ A) 

= (-^)e B ' A (A\ogA + B), 



which concludes the proof. □ 
Lemma 16. Let L be a subfield of Q, and let E/L be an elliptic curve. If 

\{P € E(L) | h(P) < S}\ < T 
for constants S > and T > 1, then 

(*) \E(L)tor\ < T, 

(ii) Uminf{ft(P) | P e E{L)} > S, and 
(Hi) h(P) > S/T 2 for all non-torsion points P G E(L). 

Proof. The bounds (i) and (ii) are trivial. In order to see (Hi), consider a non- 
torsion point P e E(L), and let M > 1 be the largest integer such that h((M — 
1)P) < S. Thus h(jP) < S for all < j < M - 1, and it follows from the 
hypothesis that M < T. By the maximality of M we have M 2 h(P) = h(MP) > S, 
and therefore ft(P) > S/M 2 > S/T 2 . □ 

6.1. Small Totally Real Points. 

Theorem 17. Let Q tr denote the maximal totally real subfield o/Q, and let E/Q tr 
be an elliptic curve with j-invariant ■ Then 

(46) \E(Q tr ) tor \ <3h*(j E ) 2 ; 



(47) liminf{/i(P) | £;(Qt r )} > 



2Ah*(j E y 

and if P £ E(<Q tr ) is a non-torsion point, then 
(48) h(P) > 



216/i*(j £ ) 5 ' 

where h*(]E) — h(j E ) + 10. 

Before we begin the proof of this result, let us first recall a few facts about elliptic 
curves over R. Given an elliptic curve E/R, there exists a unique clement r in the 
set 

(49) T= {it | t> l}u{^+it | t> 

such that j(r) = j E - Letting q = e 27TlT e K, we have isomorphisms 

(50) C*/q z C/L £7(C); 



SMALL POINTS ON ELLIPTIC CURVES 



21 



the first map in (|50(l is the inverse of the exponential map 



while the 



second map in (|50|> can be given explicitly in terms of Weierstrass functions. These 
maps restrict to isomorphisms 



(51) 
where 



7<z z 



K 



E(R), 



n = {z e C/L | e iz}. 



(Note that, while $t(z) is not well defined on C/L, the property 5R(z) S |Z is well 
defined, since 3?(r) G iZ). In particular, L(M) has either one or two connected 
components according to whether 3?(r) = 1/2 or 3?(r) = 0, respectively; or equiv- 
alently whether g < or q > 0, respectively. For more details on these assertions, 
see E3, §V.2. 

Lemma 18. Let E/M. be an elliptic curve, and let r = a + bi 6 T be chosen so that 
j(r) = ]e- If Z = {Pi, . . . , Pn} C E(J3L) is a set of N distinct real points, then we 
have the lower bound 

b 



D(Z)> 



-S,ir\T\ 2 /bN 



4tt|t| 2 

for the complex discrepancy of the set Z . 



Proof. Let {zi, . . . , z^} C C be a set of coset representatives for the pullback of 
the set Z under the second map in l|50l) . Thus Zj = r\j + T2jt, where r\j G for 
all j = 1,...,N, by l|51|) . Let S denote the probability measure on C/L assigning 
a mass of 1/N at each point Zj. Recall that the dual group Te is parametrized by 
the lattice L under the correspondence J7J). In particular, if u — n\ + S L is a 
lattice point with n\ even and ni = 0, then 



nzZ^( z ^ 

3=1 
1 N 

-Y 



= l. 

We apply this for oj = ±2 + Or, in which case u/ 
_£_^ e -27rt|u/|76 by ^ an d JJ2J ! we then have 

D(Z)= AT(7)I%)| 2 

> (7(2+0r)) + (7(_ 2 +0t) 

6 



=F2r. Since Xtfru) 



47r|r| : 



which is the desired inequality. 

Proof of Theorem \17\ Let us write h 
Lemma Il6l it suffices to show that 



-8ir\r\ 2 /bN 



h{jE) and h* = h + 10. 



□ 

In view of 



(52) 



P € L(Qtr) I HP) < 



24ft,* J 



< 3(/i*) 2 



22 



MATTHEW BAKER AND CLAYTON PETSCHE 



Let k C Qtr be a number field over which E is defined, and let Z = {Pi, . . . , Pn} C 
E{Q tI ) be a set of N totally real points with h(Pj) < l/2Ah* for all 1 < j < N. 
Select a number field K, containing k(Pi, . . . , Pjv), such that E/K is semistable. 
Denote by d = [K : Q] and d v = [K v : Q„] the global degree and local degrees of 
K, respectively. 

Let v be an archimedean place of K, corresponding to an embedding a : K c — > C. 
Since k is totally real, we have cr(fc) C R, and therefore we can view E as an elliptic 
curve defined over R, with Z C E(M). Select t v — a v + b v i in the set T defined in 
(|49f) . with <j(jE) = j( T v)- By Lemma IT%1 we have 

(53, ^^*(^). 



2e~ 



where </>(x) 

In order to assemble these local estimates at the archimedean places we will 
apply Jensen's inequality 

(54) ^ w v 4>(x v ) > 4>{^ ^ Wv%v) 

V V 

to the convex function <f>(x); here {w v } is a finite set of positive weights with 
J"V w„ = 1, and the x v are arbitrary positive numbers. Applying (|54|l with w v = 
d v /d and x v = 8n\r v \ 2 /b v N , and using the local lower bound l|53(l . we have 

v(z)> J2 

~ N ^ d \ b v N ) 

(55) v ^ m k 



2 



where t/v = Ah* /N . The last inequality in (|55|l requires an explanation: first note 
that since a v £ {0, 1/2} and b v > 1/2 for all archimedean places f of K, we have 

8tt|t„|7^ = 8n{a 2 v + bl)/b v 

< 8nb v + An. 

Therefore, by Lemma 1241 of Appendix 1X1 we have 

Ed v 8tt\t v \ 2 1 ^ d v . . 



4 E ^(41og + b £ |„ + 24 + 47r) 



vEM% 

< (Ah + 2A + An) /N 

< Ah* /N, 

which, in combination with the fact that <p(x) is decreasing for x > 0, establishes 
the last inequality in (JHHJ- 
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Now, combining the lower bound Ij55(l on T>(Z) with the upper bound on T>(Z) 
given by Theorem |H1 we have 

(56) — e~ tN < — + -logiV + — h+ — , 
v ' t N ~ 6ft* 2 8 12 5 ' 

since h(Z) < l/24ft*. We will use this inequality to show that 

(57) N < AlogN + B, 

where A = 2ft*, and B = 4ft*(^ft + ^). First, note that 1^3 is trivial if N < B, so 
we may assume that N > B > 4-10-^ = 128 (note that ft* > 10). Ift N > log(6/5), 
then 

4ft* 

N < , ,n,rs < ( 2 lQ g( 128 ) + 64/5)ft* < A log N + B. 
log(6/5) 

On the other hand, if tjv < log(6/5), then by (|56[1 we have 

5iV _ 5 
12ft* ~ 3^7 

< —e~ tN 

tN 

N 1 , *r 1 , 16 

^6F + 2 l0gjV+ i2 ft+ y 



Therefore 



^ 1 , » T 1 , 16 
< -log AH h-\ 

Ah* ~ 2 B 12 5 



and IJ57J follows. 

Finally, combining l|57|) . Lemma Il5l and some elementary calculus, we have 



N < -^-{ih* \og(2h*)+Ah*{^-h+^- 
e — 1 V 12 5 

/ 1 142 \ 

-(2ft*log(2ft*) + Ah* (-ft* + —)) 



e 

e ( 2\og{2h*) 1 5gg_W a 
lV ft* + 3 + 60ft*J l j 



e 



e f 21og(2(10)) 1 568 N a 
S e-lV 10 + 3 + 60(10) J l j 

<3(ft*) 2 , 

since ft* > 10. This establishes (I52II . and completes the proof of the theorem. □ 

6.2. Small Cyclotomic Points. In [3T], S. Zhang notes that the Szipro-Ullmo- 
Zhang equidistribution theorem |2H], together with a restriction of scalars argument, 
yields the following result: If k is a totally real number field and A/k is an abelian 
variety, then there exists an e > such that h(P) > e for all but finitely many 
points P £ A(k(fj, 00 )), where fc(^too) denotes the maximal cyclotomic extension of 
k. The key point in the proof is that the maximal totally real subfield k(fi oa ) + of 
MMoo) has index 2. In this section, we prove an explicit quantitative version of this 
result when A = E is an elliptic curve. 



24 



MATTHEW BAKER AND CLAYTON PETSCHE 



Theorem 19. Let k be a totally real subfield of Q, and let fe(/ioo) denote the 
maximal cyclotomic extension of k. If E/k is an elliptic curve with j-invariant j E , 
then 

(58) |£(MMoo)WI <36h*( ]E ) 4 , 



(59) liminf{/i(P) | PeE(fc(/i m ))}> 

and if P € i?(/c(/ioo)) is a non-torsion point, then 

1 



1 



^ ^ 864/^' 

where h*(j E ) = h(j E ) + 10. 

Proof. Suppose E/k is given in Weierstrass normal form by the equation y 2 = f(x). 
If t e Gal(fc(/Xoo)/fc) is any complex conjugation automorphism, then (fc(^oo)) r = 
k{Hoa) + is a totally real field. (This follows from the fact that Gal(fc(/ioo)/fc) is 
abelian.) 

Let P e E{k{pL tx )), and define P 1 = P + P T , P 2 = P - F r . Then P ± is totally 
real (i.e., P{ = Pi) and P2 is totally imaginary (i.e., P 2 = — ft)- 

Note that the totally imaginary points on E are in bijection with the totally real 
points on the quadratic twist E' of E defined by the Weierstrass equation 

E> : y 2 = -f(x). 

Note also that j E > — j E , since E and E' are isomorphic over k(y/— 1). 

In order to prove (|58J) . note that by Thcorem ll7l there are at most M ~ 3h*(j E ) 2 
totally real torsion points on E. Applying the same result to E' , and using the fact 
that j E i — j E , we see that there are at most M totally imaginary torsion points 
on E as well. If P £ E(k(/j, 00 )) tol , then as 2P — Pi + P 2 , it follows that there are 
at most M 2 possibilities for 2P, and thus at most 4M 2 possibilities for P. This 
proves the bound l|58ll . 

We now turn to the proof of l|59l) and (|60|l . First, we claim that given a non- 
torsion point P G Elklpac)), either there exists a non-torsion totally real point Q 
on E with h(P) > h(Q)/A : or there exists a non-torsion totally real point Q' on 
E' with h(P) > h{Q')/A. To see this, note that by the parallelogram law we have 
h{Pi) + h(P 2 ) = 2h(P) + 2h(P T ) = Ah{P) > 0. Therefore either h{P x ) > 0, in 
which case we take Q — Pi; or else h{P2) > 0, in which case we take Q' = P 2 , the 
image of P 2 under the isomorphism E ~ E' . 

In view of this claim, l|59|l follows immediately from l|47() . and (|60() from (|48|l . □ 

6.3. Small Totally p-adic Points. Let p be a prime number. A subfield L of 
Q is called totally p-adic if the rational prime p splits completely in every number 
field L' C L, or equivalently, if every embedding of L into Q p has image contained 
in Q p . 

The following result is an analogue for elliptic curves of a result of Bombieri and 
Zannier [S] for the multiplicative group: 

Theorem 20. Let k be a number field, let p =/= 2 be a prime number, and let E/k 
be an elliptic curve having semistable reduction at all places of k lying over p. If 
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L/k is totally p-adic algebraic extension, then 

(61) \E(L) tor \ < _^_(lo g M + 21ogp + At^)), 

5 logp V / 

(62) liminf{MP) | P G E{L)} > 
and if P £ E(L) is a non-torsion point, then 

(63) h(P) > H {^lf) 3 ( log M + 2 logp + fc+ (j B )) ' , 
where 

1 32 

M = max{p + 1 + 2VP, 12i/}, ^(je) = + y , 

and is £/ie maximum over all places w S Mfc Zg/OTp over p o/ i/ie quantity w + (Je) 
max{0, -ord^^E)}. 

Proof. In view of Lemma 1161 it suffices to show that 



(64) \{PeE(L) | ^(P)<^}|<_(logM + 21ogp +g ^) + T 

Let Z = {Pi, . . . , Pat} be a distinct set of points of E(L), with h(Pj) < *§f* for 
all 1 < i < and choose a finite extension if of fc over which all points of Z are 
defined. Let v be a place of K lying over p, and note that if (3 £ K is a nonzero 
element with \(3\ v < 1, then in fact \f3\ v < p^ 1 . 

Suppose first that E has good reduction at v, and let E be its reduction, which 
is an elliptic curve over the residue field F„ of 0k at v. Setting r = p^ 1 , we recall 
from 100} that 

A(^)>(A,r(^) + ^)log(l/r). 

Let m = |i?(F„)|, which by Hasse's theorem satisfies the inequality m < p + 1 + 
2y/p. Then by the pigeonhole principle, we have 

/AT t- \ \ 2 

D hr (Z)= 



N r (a) 



> E 

q£E(F„ 
1 



N 

(N/m 
{~7T 



rn 

Thus 

Now suppose that E has multiplicative reduction at v, and let v v = — ord-u(js) > 
1. Recall that the retraction homomorphism 7- : E(K V ) — > M/Z factors as 

£7(Jf ) -> if*/g z -> M/Z, 

where q S if* with \q\ v = here the first map is the Tate parametrization, 

and the second map is given by u^> log \u\ v / log (cf. H3.1(l . Note that Im(r) = 
(l/is v ) C M/Z, and therefore e 27rmr(p ' ) = 1 whenever i/„ | n. Letting 
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denote the local retraction discrepancy as defined in Section EH it follows from 
Proposition that 

D(Z)> l0g|j£|t ' V — — V e Mnr ^ 2 



> 



nSZ\{0} 1<3<N 



E 



47T 2 

(65) nei^„z\{o} 
_ log Us I 



12^ 
logp 



I2u v ' 

Using the definition of the global discrepancy and of the constant M, together 
with the fact that 

Ed v _ 1 
1 ~ ' 

we see in all cases that the global discrepancy T>(Z) of Z satisfies 



Combining this with i|32|) . we find that 
(66) 



— - — 1 logp < 4/i(Z) + — (-log AT + —h(j E ) + — 
M NJ &F ~ N\2 S 12 U J 5 



logo 1/1, , T 1 , , x 16 

< — — H (- log N-\ Mje) H 

- 2M AT V2 B 12 u y 5 

since < It follows that 

(67) N < ( log N + 2 logp + ifcfo) + f 

Applying LemmaHSlwith A = ^ and B = ^ (2 logp + \h{j E ) + f j,we obtain 

the upper bound N < |(Alog A + B). The inequality l|64|l follows upon noting that 
log log p > 0. □ 

By suitably modifying the above argument, one can establish the following gen- 
eralization of Theorem |20| we omit the details of the proof. For the statement, 
we say a subfield L of Q is totally p-adic of type (e, /) if for any embedding of L 
into Q p , the image of L is contained in a finite extension of Q p whose absolute 
ramification and residue degrees are bounded by e and /, respectively. 

Theorem 21. Let k be a number field, let p ^ 2 be a prime number, and let E/k 
be an elliptic curve having semistable reduction at all places of k lying over p. If 
L/k is an algebraic extension which is totally p-adic of type (e, f) and q = p' , then 

(68) \E(L) tor \ < ^Lflog(eM) + ^P+hHj E )), 

ologpv e / 

(69) liminf{MP) | P e E(L)} > |gg, 
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and if P £ E{L) is a non-torsion point, then 
w/iere 

_ 1 32 

M = max{a + 1 + 2^9, 12ei/}, fct(j K ) = -h{j E ) + y , 

and v is £fte maximum over all places w € Mfc fo/ing overp of the quantity w + (Je) — 
max{0, ~ord w (j E )}. 

Appendix A. A quantitative refinement of Elkies' theorem 

Our goal in this appendix is to give a proof of Proposition^] The proof is simpler 
than the one given in ^5], §VI, Theorem 5.1, in that we use only basic results from 
Fourier analysis, whereas the proof presented by Lang uses a number of nontrivial 
results concerning elliptic differential equations and eigenfunctions of the Laplacian 
on a Riemannian manifold. At the same time, our proof yields better quantitative 
information than one obtains in the general case by using certain explicit estimates 
for the modular j-function. We also include a discrepancy term in the estimate 
which plays an important role in this paper. 

For the reader's convenience, we recall the statement of Proposition 

Proposition |3J Let E/C be an elliptic curve with j-invariant je, and let 

Z = {P 1 ,...,P N }cE(C) 

be a set of N distinct points. Then D(Z) > 0, and 

E - ^) > ^D(Z) - ^fL - g log+ \ jE \ - f N . 

l<i,j<N 

Before giving the proof, we need a series of preliminary results. 
Lemma 22. The kernel g t is nonnegative. 

Proof. Let ip E S(E(C)) be a nonnegative test function, normalized so that ^(70) — 
Ie(c) ^( P )M p ) = 1, and dcnne u(P,t) € C*°°(£;(C) x [0,+oo)) by 

'g t *tp(P) if * > 
ip(P) if * = 0. 



u(P,t) 



= E kl)e- trKl h{P)- 

In view of Proposition [2] we see that u is a solution to the heat equation 

du 

m =Au - 

Such a function satisfies a "maximum principle," meaning it must take its extrema 
on the boundary E(C) x {0} (cf. ^3|> (4-16)). In particular, we have 

Mu(P,t) = inf u(P,0), 

PA P V ' 

and since u(P, 0) = ip(P) > 0, it follows that u(P,t) is nonnegative. If we now fix 
t > and take a sequence of such if> so that ip(P)dfi(P) converges weakly to a unit 
point mass at P = O, we have u(P, t) — > gt(P), and therefore gt(P) > 0. □ 
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Lemma 23. For P e E(C) \ {O} and t > 0, we have 

X(P)>X t (P)-t. 

Proof. We have 

X t (P) - A(P) = J §- s K(P)ds 

rt §~ s { E kf)9 s (7h(P))ds 
7sr E 



o 



7er E \{ 70 } 

-J*( E e - sA(7) 7(p))< 

7er B \{7o} 

- / g s (P)ds + l 
Jo 



since g s (P) > 0. □ 



We will also require the following estimate involving the modular j-function 
j : H -> C. 

Lemma 24. for t = a + W € i, we /lave 

(71) 2^6<log+|i(r)|+6. 

Proof. Let < i < 1 be a parameter to be chosen later, and put q = e 27 " T . If 
\q\ > t, then 

2-ko = — log \q\ 

(72) < log(l/i) 

<log+ |j(r)|+log(lA). 

On the other hand, suppose that |g| < t. Recall that j(r) = 1728g2(T) 3 /A(r), 
where A(r) = {2 / n) 12 q n„>i(l — q n ) 2A is the modular discriminant function, and 

.^r) = ^ (l + 240 ]T]T A""*) 

d>l m>l 
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is the Eisenstien series of weight 4 (cf. (2HJ , ch. VII) . We have 
log|A(r)/g| = 121og(27r) + 24^ log |1 - q n \ 



n>l 

kn 



= 12 log(27r) ~243t^2Y^ k ~ 1( l 

n>l k>l 

< 12 log(27r) + 24:J2Y1 k ^ 

n>l fe>l 

= 121og(2.) + 24^^ 



k>l 



24 

< 121og(2^) + — - t Y, k ~ hk 

k>l 

24 /In 
= 12]og(2,r) + — log (_). 

Also, using the inequality |1 + u\ > 1 — \u\ and the identity Yld>i ^ 3 t d — *(1 + 4i - 
t 2 )/(l — t) 4 , we estimate 

| ff2 (r)| = ^ 1 + 240 ]T£ A md 



12 

(2t^ 4 



rf>l m>l 
d>l m>l 



>^(l"240EE^ 

d>lm>l 



12 



d>l 

\4 



- 12 V 

d>l 

(27r) 4 ^ 240i(l + 4i + i 2 



12 V (1-t) 
Combining these two estimates, we have 

27r6 = — log |g| 
(?3) =log|j(r)| -log|1728 52 (r) 3 g/A(r)| 

, log+ , (T)l + ^ log (^)_ log(l _?i2fc|±i!) 

We now select £ to be the (only) root to of the polynomial f(t) = 240i(l + At + 
t 2 ) — (1 — t) 6 in the interval (0, 1). This choice, while perhaps not quite optimal, 
does simplify matters since it satisfies the identity 

,*as , /-, us i A, 240t (l + 4t +t%)\ 

(74) log(lAo) = -log(l ( \ _ -J- 

Therefore, in view of the two cases (|72[1 and Q73[l. and the identity Q74|). it follows 
that 

(75) 2^6 < log+ |j(r)| + log(l/t ) + log ( ' 



1 - t V 1 - t, 
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for all t = a + ib € H. Finally, using the fact that f(t) is increasing for < t < 1, 
it is straightforward to show that 1/250 < to < 1/249, and therefore 



log 



24 / 1 \ 24 / 1 \ 

fl An) + log I ] < log(250) + ; log ( ; ) 

W 0J l-t S VI - t / - BV ' 1 - 1/249 B VI - 1/249/ 



1/249 ° VI- 1/249 v 

< 6, 



which, along with l|75(l . finishes the proof of the lemma. □ 

Remark 76. In |16| . Hindry and Silverman prove the reverse inequality 2irb > 
log + |j'(t)| — 2.304 by a similar method. 

Lemma 25. Let E/C be an elliptic curve with j-invariant je- Then for < t < 1, 
we have 

(77) \ t (0) < \ log(l/i) + ^ log+ \j E \ + 11/5. 

Proof. Fix a parametrization -E'(C) ~ C/L, where L = Z + rZ is a lattice and 
r = a + bi is chosen in the usual fundamental domain 

f ={z | -1/2 < <$t(z) < 1/2, \z\ > 1} U {e* 9 | tt/3 < 9 < tt/2} 

for the action of the modular group on H. Thus je = j( r )> an( i n °te in particular 
that b > y/3/2. 

Given a nonzero lattice point u) € L, we have the associated (nontrivial) character 
7w gTb, as defined in Q. It follows from © and Proposition [21 that 

A(7c 



2tt|o;'| 2 ' 

where u/ is defined in (JSJl. 

Since u> i— > a/ is a permutation on L which fixes zero, we have 

MO) - E A( 7 )e-*^W 
7er B \{ 7o } 



E 



b 



2ir\u\ 2 

w€L\(0) 1 1 



-27rH 2 t/h 



< A V — + - V — e -^\ 2 t/b 

~ 2tt 4-; k 2 n V M 2 



where the latter is a Riemann-Stieltjes integral, and 

S(x)= e 1 



\cj\<y/x 



is the number of lattice points uj € L, with positive imaginary part, in the disc 
{|z| < An elementary lattice point counting argument provides a bound 

S(x) < ^x + \\fx. Also, it is clear that S(x) = for x < b 2 , since no lattice point 
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in L can have a positive imaginary part smaller than b. Integrating by parts, we 
have 

At (0) < ^ + h - [°° - e - 2 ^ b dS(x) 

S(x)( 1 + 2 £ x/b )e- 2 ^ b dx 

(78) + {^ X+ \ X y% l + 2 ^ X,h )e-^IH X 

1 + 2ntx/b^ „_ w 6 . , f°° f 1 + ^tx/b^ 



■nb 




/■CSC 


y ~ 




'o 


■nb 




I'OC 


y ~ 




'o 


Kb 




/■QO 


~6~ " 


^ 7T J 






/*°° 


~6~ " 


A 2 


V 









2x 



where Ji and J2 are intergals which we now estimate. First, upon making the 
change of variables u = tx/b 2 , we have 



1 t°° 

h = - x~\\ + 2Ktx/b)e~ 27rtx/b dx 

2 Jb 2 



u- 1 (l + 2Kbu)e- 2 * bu du 



< 



1 

-1 nl 1 /> CO /» CO 

-/ u- 1 e- 27rbu du+ - u^er^du + Kb e'^du 

2 Jt % Ji Jt 

-1 />1 1 />oo />oo 

-/ u- 1 du+- e- 27Tbu du + nb er 27Tbu du 

2 Jt 2 ,y 



Also, 



1, , M 1 1 
= 2 l0g(1/t) + 4^ + 2- 



h = 



n + 2Ktx/b y Mx/bdx 

V KX 1 / 

ti" Jb 2 " J b 2 



< A + ^ r e - Mx ' b dx 



Kb b 2 

3 



b 2 



< 
Kb 

Combining the estimates on these integrals with Lemma|2I and using the fact that 
b > VS/2 (since t € f), we deduce that 

A , ( o> < 1,^/0 + 1 + ^ + 1 

and C3 follows, since 13/271-^+1 < 11/5. □ 
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Proof of Proposition^ The positivity of the discrepancy follows at once from (|16ll . 
For t > we have 



A(Pi-P.,)> £ XtiPi-P^-NH 

l<i,j<N 

(79) 



l<i,j<iV Ki,j<JV 



= J2 h{Pi-Pj)-N\ t (0)-N% 

l<i,j<N 

by Lemma 1251 Selecting t = 1/N and using Lemma 031 we deduce the desired 
bound 



HP-P 3 )>N 2 D(Z)-NX 1/N (0)-N 



l<i,j<N 

(80) i^j 



> w * D(z) _^_g log+fo i_^. 



□ 
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